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Anaerococcus pacaensis strain 9403502 1 , is the type strain of Anaerococcus pacaensis sp. nov., 
a new species within a new genus Anaerococcus. This strain, whose genome is described here, 
was isolated from a blood sample. A. pacaensis strain 9403502 T is an obligate anaerobic Gram- 
positive coccus. Here we describe the features of this organism, together with the complete ge- 
nome sequence and annotation. The 2.36 Mbp long genome exhibits a G+C content of 35.05% 
and contains 2,1 86 protein-coding and 72 RNA genes, including 3 rRNA genes. 



Introduction 

Anaerococcus pacaensis strain 9403502 T (= CSUR 
P122 = DSM 26346], is the type strain of 
Anaerococcus pacaensis sp. nov., and a member of 
the genus Anaerococcus. This bacterium is a Gram- 
positive, anaerobic, non spore-forming, indole neg- 
ative coccus that was isolated from a blood sample, 
during a study prospecting anaerobic isolates from 
deep samples [1]. 

The "gold standard" method to define a new bacte- 
rial species or genus is DNA-DNA hybridization and 
G+C content determination [2]. Those methods are 
expensive and poorly reproducible and actually, 
bacterial species can be classified with PCR and se- 
quencing methods, particularly 16S rRNA sequenc- 
es with internationally-validated cutoff [3]. More 
recently, an increasing number new bacterial gene- 
ra and species have been described using high 
throughput genome sequencing and mass spectro- 
metric analyses that allow access to the wealth of 
genetic and proteomic information [4,5]. In the 
past, studies have described new bacterial species 
and genera using genome sequencing, MALDI-TOF 
spectra, main phenotypic characteristics [6-23], 
and we propose here to describe a new species 
within the genus Anaerococcus in the same way. 

Here we present a summary classification and a set 
of features for A. pacaensis sp. nov. strain 9403502 T 
(= CSUR P122= DSM 26346] together with the de- 
scription of the complete genomic sequencing and 
annotation. These characteristics support the cir- 
cumscription of a novel species, Anaerococcus 
pacaensis sp. nov., within the genus Anaerococcus, 
and within the Clostridials Family XI Incertae sedis. 
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The genus Anaerococcus was first described in 
2001 [24], and belongs to the Clostridiales Family 
XI Incertae sedis. This family is defined mainly on 
the basis of phylogenetic analyses of ARNr 16S se- 
quences, and in the Anaerococcus genus, bacteria 
are all anaerobic gram positive cocci. Based on the 
comparison of the 16S rRNA gene sequence, the 
first closest related species to Anaerococcus 
pacaensis sp., nov., is Anaerococcus prevotii. It was 
first described in 1948 by Foubert and Douglas [25] 
and reclassified later in the genus Anaerococcus 
[24]. The second closest related species is A. 
octavius, which was described first as 
Peptostreptococcus octavius, isolated from a human 
sample in 1998 by Murdoch et al [26]. It was later 
re-classified in the genus Anaerococcus, as A. 
octavius [24]. 

Classification and features 

A blood sample was collected from a patient during 
a study analyzing emerging anaerobes, with 
MALDI-TOF and 16S rRNA gene sequencing [1]. 
The specimen was sampled in Marseille and pre- 
served at -80°C after collection. Strain 9403502 T 
(Table 1] was isolated in July 2009, by anaerobic 
cultivation on 5% sheep blood-enriched Columbia 
agar (BioMerieux, Marcy l'Etoile, France]. This 
strain exhibited a 95% nucleotide sequence simi- 
larity with Anaerococcus prevotii [24,25]. Those 
similarity values are lower than the threshold rec- 
ommended to delineate a new genus without carry- 
ing out DNA-DNA hybridization [38]. In the in- 
ferred phylogenetic tree, it forms a distinct lineage 
close to A octavius (Figure 1]. 
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Table 1. Classification and general features of Anaerococcus pacaensis strain 9403502 1 
MIGS ID Property Term 



Evidence code 3 



MIGS-6.3 
MIGS-22 



MIGS-6 
MIGS-15 



MIGS-14 

MIGS-4 

MIGS-5 

MIGS-4. 1 

MIGS-4. 1 

MIGS-4.3 

MIGS-4.4 



Current classification 



Gram stain 

Cell shape 

Motility 

Sporulation 

Temperature range 

Optimum temperature 

Salinity 

Oxygen requirement 
Carbon source 
Energy source 
Habitat 

Biotic relationship 
Pathogenicity 
Biosafety level 
Isolation 

Geographic location 

Sample collection time 

Latitude 

Longitude 

Depth 

Altitude 



Domain Bacteria 

Phylum Firmicutes 

Class Clostridia 

Order Clostridiales 

Family XI Incertae sedis 

Genus Anaerococcus 

Species Anaerococcus pacaensis 

Type strain 9403 5 02 T 

Positive 

Cocci 

Non motile 

Non spore-forming 

Mesophile 

37°C 

Weak growth on BHI medium + 1% NaCI 

Anaerobic 

Unknown 

Unknown 

Blood 

Free living 

Unknown 

2 

Human blood sample 

France 

July 2009 

43.296482 

5.36978 

Surface 

0 above see level 



TAS [27] 

TAS [28-30] 

TAS [31,32] 

TAS [33,34] 

TAS [35] 

TAS [36] 

IDA 

IDA 

IDA 

IDA 

IDA 

IDA 

IDA 

IDA 

IDA 

IDA 

NAS 

NAS 

IDA 

IDA 

NAS 



IDA 
IDA 
IDA 
IDA 
IDA 
IDA 



•Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the 
literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based 
on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene 
Ontology project [37]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the 
authors or an expert mentioned in the acknowledgements. 



Different growth temperatures (23°C, 25°C, 28°C, 
32°C, 35°C, 37°C, 50°C] were tested; no growth oc- 
curred at 23°C, 25°C, 28°C and 50°C, growth oc- 
curred between 32° and 37°C, and optimal growth 
was observed at 37°C. 

Colonies are punctiform, very small, grey, dry and 
round on blood-enriched Columbia agar under an- 
aerobic conditions using GENbag anaer 
(BioMerieux]. Bacteria were grown on blood- 
enriched Columbia agar (Biomerieux], in BHI broth 
medium, and in Trypticase-soja TS broth medium, 
under anaerobic conditions using GENbag anaer 
(BioMerieux], under microaerophilic conditions 



using GENbag microaer (BioMerieux] and in the 
presence of air, with 5%C02. They also were grown 
under anaerobic conditions on BHI agar, and on 
BHI agar supplemented with 1% NaCI. Growth was 
achieved only anaerobically, on blood-enriched Co- 
lumbia agar, and weakly on BHI agar, and BHI agar 
supplemented with 1% NaCI after 72h incubation. 
Gram staining showed round non spore-forming 
Gram-positive cocci (Figure 2]. The motility test 
was negative. Cells grow anaerobically in TS broth 
medium have a mean diameter of 1.140|im (min = 
0.955|im; max = 1.404um], as determined using 
electron microscopic observation after negative 
staining (Figure 3]. 
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96 



99 



100 



0.02 



Anaerococcus lactolyticus CCUG31 351 (AF542233) 

Anaerococcus murdochii WAL1 7230 (DQ91 1 243) 

- Anaerococcus prevotii DSM20548 (CP001 708:81 697-8321 1 ) 
Anaerococcus tetradius CCUG46590 (AF54223) 

Anaerococcus pacaensis 9403502 (HM587325) 

Anaerococcus octavius NCTC981 0 (Y07841 ) 

- Anaerococcus hydrogenalis GIFU7662 (D14140) 

- Anaerococcus vaginalis CCUG31 349 (AF542229) 
Clostridium buTyricum VPI3266 (AJ458420) 



Figure 1. Phylogenetic tree highlighting the position of Anaerococcus pacaensis strain 9403502 7 relative to other 
type strains within the genus Anaerococcus. CenBank accession numbers are indicated in parentheses. Sequences 
were aligned using CLUSTALW, and phylogenetic inferences obtained using the maximum-likelihood method with- 
in the MEGA 4 software [39]. Numbers at the nodes are bootstrap values obtained by repeating the analysis 500 
times the analysis to generate a majority consensus tree. Clostridium butyricum was used as outgroup. The scale bar 
represents a 2% nucleotide sequence divergence. 
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Figure 3. Transmission electron microscopy of A. pacaensis strain 9403502', 
using a Morgani 268D (Philips) at an operating voltage of 60kV. The scale 
bar represents 500 nm. 



Strain 9403502 T exhibited catalase activity but no 
oxidase activities. Using API 20A, a positive reac- 
tion could be observed only weekly for Gelatinase. 
Using Api Zym, a positive reaction was observed 
for alkaline phosphatase (5nmol of hydrolyzed 
substrata], acid phosphatase (5nmol], 
naphtolphosphohydrolase (5nmol], and 
hyaluronidase (40nmol]. Using Api rapid id 32A, a 
positive reaction could be observed only for beta 
glucuronydase and pyroglutamic acid 
arylamidase. Regarding antibiotic susceptibility, A. 
pacaensis was susceptible to penicillin G, amoxicil- 
lin, cefotetan, imipenem, metronidazole and 
vancomycin. When compared to the representa- 
tive species within the genus Anaerococcus, A. 
pacaensis exhibits the phenotypic characteristics 
details in Table 2 [40]. 

Matrix-assisted laser-desorption/ionization time- 
of-flight (MALDI-TOF] MS protein analysis was 
carried out as previously described [41]. A pipette 
tip was used to pick one isolated bacterial colony 
from a culture agar plate, and to spread it as a thin 
film on a MTP 384 MALDI-TOF target plate 
(Bruker Daltonics, Germany]. Ten distinct depos- 
its were done for strain 9403502 T from ten isolat- 
ed colonies. Each smear was overlaid with 2 \iL of 
matrix solution (saturated solution of alpha- 
cyano-4-hydroxycinnamic acid] in 50% acetoni- 
trile, 2.5% tri-fluoracetic acid, and allowed to dry 
for five minutes. Measurements were performed 
with a Microflex spectrometer (Bruker]. Spectra 
were recorded in the positive linear mode for the 



mass range of 2,000 to 20,000 Da (parameter set- 
tings: ion source 1 (ISI), 20kV; IS2, 18.5 kV; lens, 7 
kV). A spectrum was obtained after 675 shots at a 
variable laser power. The time of acquisition was 
between 30 seconds and 1 minute per spot. The 
ten 9403502 1 spectra were imported into the 
MALDI Bio Typer software (version 2.0, Bruker] 
and analyzed by standard pattern matching (with 
default parameter settings] against the main spec- 
tra of 5,697 bacteria, in the Bio Typer database. 
The method of identification includes the m/z 
from 3,000 to 15,000 Da. For every spectrum, 100 
peaks at most were taken into account and com- 
pared with the spectra in database. A score ena- 
bled the identification, or not, from the tested spe- 
cies: a score > 2 with a validated species enabled 
the identification at the species level; a score > 1.7 
but < 2 enabled the identification at the genus lev- 
el; and a score < 1.7 did not enable any identifica- 
tion. For strain 9403502 T , the best obtained score 
was 1.265, which is not significant, suggesting that 
our isolate was not a member of a known genus. 
Our database was incremented with the reference 
spectrum from strain 9403502 1 (Figure 4]. A 
dendrogram was constructed with the MALDI Bio 
Typer software (version 2.0, Bruker], comparing 
the reference spectrum of strain 9403502 T with 
reference spectra of 26 bacterial species, all be- 
longing to the order of Clostridials. In this 
dendrogram, strain 9403502 T appears as a sepa- 
rated branch within the genus Anaerococcus (Fig- 
ure 5]. 
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Table2. Differential characteristics of Anaerococcus pacaensis sp. nov., strain 9403502 1 , A. octavius strain NCTC 
981 0 T , and A. tetradius strain DSM 2951 T . 



Properties 



A. pacaensis 



Cell diameter ((jm) 
Oxygen requirement 
Gram stain 

Optimal growth temperature 
Habitat 



0.9-1.4 
Anaerobic 
Positive 
37°C 
Human 



0.7-0.9 
Anaerobic 
Positive 

na 
Human 



0.5-1.8 
Anaerobic 
Positive 

na 
Human 



Enzyme production 

Indole 

Alkaline Phosphatase 

Urease 

Catalase 

Gelatinase 



na 



na 



Activity of 

Phosphatase 



Acid phosphatase 
Naphtolphosphohydrolase 



na 



na 



Saccharolytic enzyme 



Hyaluronidase 



a-glucosidase 
P-glucosidase 
P glucuronidase 



Proteolytic enzyme 



Proline arylamidase Arginine arylamidase 

Pyroglutamyl arylamidase Histidine arylamidase 



Utilization of 

Glucose 
Mannose 
Lactose 
Raff i nose 
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Figure 4. Reference mass spectrum from A. pacaensis strain 9403502 . Spectra from 10 
individual colonies were compared and a reference spectrum was generated. 



MSP Dendrogram 



1 000 



600 400 
Distance Level 



200 



- 



Prevotella maculosa DSM 19339T DSM 
Prevotella intermedia DSM 20706T DSM 
Prevotella disiens DSM 20516T DSM 
Prevotella corporis DSM 18810T DSM 
Prevotella stercorea DSM 18206T DSM 
Prevotella multisaccharivorax DSM 17128T DSM 
Prevotella shahn DSM 1561 1T DSM 
Prevotella bergensis DSM 17361T DSM 
Anaerotruncus colihominis 8305741 
Peptostreptococcus anaerobius DSM 2949T DSM 
Anaerococcus tetradius DSM 2951T DSM 
Anaerococcus prevotii DSM 20548T DSM 
Anaerococcus octavius DSM 11663TDSM 
Anaerococcus pacaensis 
Anaerococcus vaginalis DSM 7457T DSM 
Anaerococcus senegalensis 
Anaerococcus obesiensis 
Anaerococcus hydrogenalis DSM 7454T DSM 
Anaerococcus grossensis 
Clostridium tetani type 1 1049_NCTC 279T BOG 
Clostridium perfringens DSM 756T VML 
Clostridium butyricum DSM 10702T DSM 
Anaerococcus murdochii DSM 21462T DSM 
Anaerococcus lactolyticus DSM 7456T DSM 
Finegoldia magna DSM 20470T DSM 
Tissierella praeacuta DSM 5675 DSM 
tissierella preacuta 11014005 



Figure 5. Dendrogram based on the comparison of the A. pacaensis strain 9403502 MALDI- 
TOF reference spectrum, and 26 other species of the order of Clostridiales. 
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Genome sequencing and annotation 

Genome project history 

The organism was selected for sequencing on the 
basis of its phylogenetic position, 16S rRNA similar- 
ity to other members of the Anaerococcus genus, 
and is part of a study prospecting anaerobic bacte- 
ria in several clinical deep samples. It was the first 
genome of the new genus Anaerococcus pacaensis 
sp. nov., and the 7th genome of Anaerococcus sp. 

The Genbank accession number is CAJJ020000000 
(CAJJ020000001-CAJJ020000053] and consists of 
14 scaffolds with a total of 53 contigs. Table 2 
shows the project information and its association 
with MIGS version 2.0 compliance. 

Growth conditions and DNA isolation 

A. pacaensis sp. nov. strain 9403502 1 , CSUR= P122, 
DSM = 26346, was grown on blood agar medium at 
37°C under anaerobic conditions. Eight petri dishes 
were spread and resuspended in 5 xl00[d of G2 
buffer. A first mechanical lysis was performed by 
glass powder on the Fastprep-24 device (Sample 
Preparation system] from MP Biomedicals, USA 
during 2x20 seconds. DNA was then incubated for a 
lysozyme treatment (30 minutes at 37°C) and ex- 
tracted through the BioRobot EZ 1 Advanced XL 
(Qiagen]. The DNA was then concentrated and pu- 
rified on a Qiamp kit (Qiagen]. The yield and the 
concentration were measured by the Quant-it 
Picogreen kit (Invitrogen] on the Genios_Tecan 
fluorometer at 15.7ng/|il. 

Genome sequencing and assembly 

A 3 kb paired end libraries was pyrosequenced on 
the 454 Roche Titanium. This project was loaded 
on a 1/4 region on PTP Picotiterplates. 5 \ig of DNA 
was mechanically fragmented on the Hydroshear 
device (Digilab, Holliston, MA,USA] with an 



enrichment size at 3-4kb. The DNA fragmentation 
was visualized through the Agilent 2100 
BioAnalyzer on a DNA labchip 7,500 with an opti- 
mal size of 3.2 kb. The library was constructed ac- 
cording to the 454 Titanium paired end protocol 
and manufacturer. Circularization and nebulization 
were performed and generated a pattern with an 
optimal at 604 bp. After PCR amplification through 
15 cycles followed by double size selection, the sin- 
gle stranded paired end library was then quantified 
on the Agilent 2100 BioAnalyzer on a RNA pico 
6,000 labchip at 91pg/pl . The library concentra- 
tion equivalence was calculated at 2.76E+08 mole- 
cules/uL. The library was stocked at -20°C until 
using. 

The library was clonal amplified with 0.5 and 1 cpb 
in 2 emPCR reactions in each condition with the GS 
Titanium SV emPCR Kit (Lib-L] v2 . The yield of the 
emPCR was 10.46 and 11.53% respectively accord- 
ing to the quality expected by the range of 5 to 20% 
from the Roche procedure. 790,000 beads were 
loaded on the GS Titanium PicoTiterPlates PTP Kit 
70x75 sequenced with the GS Titanium Sequencing 
Kit XLR70. 

The run was performed in overnight and then ana- 
lyzed on the cluster through the gsRunBrowser and 
gsAssembler_Roche. The global 221,117 passed 
filter sequences generated 71.95Mb with a length 
average of 325bp. 

The 454 sequencing generated 607,067 reads 
(105,03 Mb] assembled into contigs and scaffolds 
using Newbler version 2.7 (Roche] and Opera soft- 
ware vl.2 [42] combined to GapFiller V1.10 [43]. 
Finally, the available genome consists of 14 scaf- 
folds and 53 contigs, with a coverage of 44.9. 



Table 3. Project information 



MIGS ID 


Property 


Term 


MIGS-31 


Finishing quality 


Non-contiguous finished 


MIGS-28 


Libraries used 


One 454 PE 3-kb library 


MIGS-29 


Sequencing platforms 


454 GS FLX+ Titanium 


MIGS-31. 2 


Sequencing coverage 


44.9 


MIGS-30 


Assemblers 


Newbler 2.7 


MIGS-32 


Gene calling method 


Prodigal 2.5 




Genbank ID 


CAJJ020000000 (CAJJ020000001 -CAJJ020000053) 




Genbank Date of Release 


April 21, 2013 


MIGS-13 


Source material identifier 


DSM 26346 




Project relevance 


Prospection of anaerobic isolates in clinical samples 
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Genome annotation 

Non-coding genes and miscellaneous features were 
predicted using RNAmmer [44], ARAGORN [45], 
Rfam [46], PFAM [47]. Open Reading Frames 
(ORFs] were predicted using Prodigal [48] with 
default parameters but the predicted ORFs were 
excluded if they were spanning a sequencing GAP 
region. The functional annotation was achieved 
using BLASTP [49] against the GenBank database 
[50] and the Clusters of Orthologous Groups (COG] 
database [51,52]. 

Genome properties 

The genome of Anaerococcus pacaensis strain 
9403502 T is estimated at 2.36 Mb long with a G+C 
content of 35.05% (Figure 6 and Table 3]. A total of 
2,186 protein-coding and 72 RNA genes, including 



3 rRNA genes, 42 tRNA, 1 tmRNA and 26 miscella- 
neous other RNA were founded. The majority of the 
protein-coding genes were assigned a putative 
function (74.1%] while the remaining ones were 
annotated as hypothetical proteins. The properties 
and the statistics of the genome are summarized in 
Tables 3 and 4. The Table 5 presents the difference 
of gene number (in percentage] related to each 
COG categories between Anaerococcus pacaensis 
and Anaerococcus prevotii DSM 20548. The propor- 
tion of COG is highly similar between the two spe- 
cies. The maximum difference is related to the COG 
"Carbohydrate Metabolism and transportation" 
which does not exceed 1.94%. The distribution of 
genes into COGs functional categories is presented 
in Table 6. 



o 



i 




1200000 

Figure 6. Graphical circular map of the genome. From outside to the center: scaffolds are in grey 
(unordered), genes on forward strand (colored by COG categories), genes on reverse strand (colored 
by COG categories), RNA genes (tRNAs green, rRNAs red, tm RNAs black, misc_RNA pink), GC con- 
tent (black/grey), and GC skew (purple/olive). 
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Table 4. Nucleotide content and gene count levels of the genome 
Attribute Value % of Total 



fiPnomp sizp Odd) 


2 360 033 


1 00 


DNIA rorlinp rppion fhn^ 


2 07 1 ! 031 


98.86 


DNA G+C content (br>) 


827,1 91 


35.05 


Tot pi 1 ppnpc; 


2,272 


1 00 


rRNA 


3 


0.1 3 


tRNA 


42 


1 85 


tmRNA 


1 


0.04 


miscRNA 


26 


1.14 


Protein-coding genes 


2,186 


96.21 


Genes with function prediction 


1,620 


74.10 


Genes assigned to COGs 


2,154 


98.54 



The total is based on either the size of the genome in base pairs 
or the total number of protein coding genes in the annotated ge- 
nome 



Table 5. Number of genes associated with the 25 general COG functional categories 



Code 


Value 


% of total 3 


Description 


J 


158 


6.91 


Translation 


A 


4 


0.17 


RNA processing and modification 


K 


160 


6.99 


Transcription 


L 


167 


7.3 


Replication, recombination and repair 


B 


4 


0.17 


Chromatin structure and dynamics 


D 


46 


2.01 


Cell cycle control, mitosis and meiosis 


Y 


0 


0 


Nuclear structure 


V 


97 


4.24 


Defense mechanisms 


T 


72 


3.15 


Signal transduction mechanisms 


M 


93 


4.06 


Cell wall/membrane biogenesis 


N 


16 


0.7 


Cell motility 


Z 


3 


0.13 


Cytoskeleton 


W 


0 


0 


Extracellular structures 


u 


48 


2.1 


Intracellular trafficking and secretion 


o 


93 


4.06 


Posttranslational modification, protein turnover, chaperones 


c 


129 


5.64 


Energy production and conversion 


G 


148 


6.47 


Carbohydrate transport and metabolism 


E 


145 


6.34 


Amino acid transport and metabolism 


F 


70 


3.06 


Nucleotide transport and metabolism 


H 


76 


3.32 


Coenzyme transport and metabolism 


I 


54 


2.36 


Lipid transport and metabolism 


P 


157 


6.86 


Inorganic ion transport and metabolism 


Q 


23 


1.01 


Secondary metabolites biosynthesis, transport and catabolism 


R 


272 


11.89 


General function prediction only 


S 


253 


11.06 


Function unknown 




32 


1.4 


Not in COGs 



The total is based on the total number of protein coding genes in the annotated genome. 



556 



Standards in Genomic Sciences 



Pagnier et al. 



Insights into the genome sequence 

We made some brief comparisons against 
Anaerococcus prevotii DSM 20548 (NC_013171), 
which is currently the closest available genome. 
This genome contains 1 chromosome (accession 
number: NC_013171] and 1 plasmid (accession 
number: NC.013164). 

The draft genome sequence of Anaerococcus 
pacaensis has a bigger size compared to the 



Anaerococcus prevotii (respectively 2,36 Mbp and 
1,99 Mbp]. The G+C content is slightly larger than 
Anaerococcus prevotii too (respectively 37.5% and 
35.05%]. Anaerococcus pacaensis shares more 
genes (2,272 genes against 1,916 genes], however 
the ratios of genes per Mb is very similar (962,71 - 
962,81]. 



Table 6. Percentage of genes associated with the 25 general COG functional categories for Anaerococcus pacaensis 
and Anaerococcus prevotii DSM 20548. 



Code 


COG description 


A. pacaensis %age A. prevotii %age 


Difference (in %) 


J 


Translation 


6.91 


7.53 


-0.62 


A 


RNA processing and modification 


0.17 


0.10 


0.07 


K 


Transcription 


6.99 


6.91 


0.08 


L 


Replication, recombination and repair 


7.30 


6.13 


1.17 


B 


Chromatin structure and dynamics 


0.17 


0.16 


0.01 


D 


Cell cycle control, mitosis and meiosis 


2.01 


1.56 


0.45 


Y 


Nuclear structure 


0.00 


0.05 


-0.05 


V 


Defense mechanisms 


4.24 


3.43 


0.81 


T 


Signal transduction mechanisms 


3.15 


3.17 


-0.02 


M 


Cell wall/membrane biogenesis 


4.06 


5.24 


-1.18 


N 


Cell motility 


0.70 


0.36 


0.34 


Z 


Cytoskeleton 


0.13 


0.16 


-0.03 


w 


Extracellular structures 


0.00 


0.00 


0.00 


u 


Intracellular trafficking and secretion 


2.10 


1.92 


0.18 


o 


Posttranslational modification, protein turnover, 
chaperones 


4.06 


3.63 


0.43 


c 


Energy production and conversion 


5.64 


6.59 


-0.95 


G 


Carbohydrate transport and metabolism 


6.47 


8.41 


-1.94 


E 


Amino acid transport and metabolism 


6.34 


6.65 


-0.31 


F 


Nucleotide transport and metabolism 


3.06 


3.69 


-0.63 


H 


Coenzyme transport and metabolism 


3.32 


3.58 


-0.26 


1 


Lipid transport and metabolism 


2.36 


2.34 


0.02 


P 


Inorganic ion transport and metabolism 


6.86 


6.80 


0.06 


Q 


Secondary metabolites biosynthesis, transport and 
catabolism 


1.01 


0.78 


0.23 


R 


General function prediction only 


11.89 


11.21 


0.68 


S 


Function unknown 


11.06 


9.61 


1.45 




Not in COGs 


1.40 


0.99 


0.41 



Conclusion 

On the basis of phenotypic, phylogenetic and ge- 
nomic analysis, we formally propose the creation 
of Anaerococcus pacaensis, whichcontains the 
strain 9403502T This bacterium has been found 
in Marseille, France. 

Description of Anaerococcus pacaensis sp. nov. 

Anaerococcus pacaensis (pa.ca'en.sis L. gen. masc. 
n. pacaensis, of PACA, the acronym of Provence 



Alpes Cote d'Azur, the region where was isolated 
Anaerococcus pacaensis). Isolated from a blood 
sample from a patient from Marseille. A. pacaensis 
is a Gram-positive cocci, obligate anaerobic, non- 
spore-forming bacterium. Grows on axenic medi- 
um at 37°C in anaerobic atmosphere. Negative for 
indole. Non-motile. The G+C content of the ge- 
nome is 35.05%. The type strain is 9403502 T (= 
CSURP122 = DSM 26346]. 
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